CDS
Accession Number | TCMCG075C10316 |
gbkey | CDS |
Protein Id | XP_007038985.2 |
Location | complement(join(29687208..29687343,29687851..29689500,29690208..29690632,29690886..29690930)) |
Gene | LOC18605731 |
GeneID | 18605731 |
Organism | Theobroma cacao |
Protein
Length | 751aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007038923.2 |
Definition | PREDICTED: plastidial pyruvate kinase 4, chloroplastic [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGACGGCTTCTATAGTTACGAATCACAAGGCTATTCCACGCCATGCTATCCACACAGCTGATGTTTCGGAATTGGGATTATGTACCCTGGGGGGGAAGTTTGCATATCTGCCTTGCAGACTTAATGTAAGACAGTCAGCGCAAATGGTTCAATTATTAGCAAAATTCAGGAACACCCTCGCTCGAAAGACAACAGCTTTTGCTATTCCAAATGAAAACAATGAAGTTGAAAGAAGTGGTTCGCATGCTTGTTCCGATGATCAAGTGCTCATGCCTTTGGAAAATTACAATAGTTCCGGTCATCTGGAAGGGGAAGCTGTTGATTCTCTTTCAAAGTCAGAAGCAGGATTTTTTCAGAGGGTTGAGCACTTGGGAAACCAGGCAAGGGTTCTTGACAAATTGAGGGCTGTATATTTACATGTGTTAGCATCGGAACAATGGAATGCTTCTAGCCTGAAGCTATCTCACAAAAATTACATGGAGAGTGCAACAAACTTAATCCATTATCTGGCTTTGAAATCTCTTGACACTGAAGCACTCAAAGATGATCTTGCTTTGATCAGTCTGTTGAATTTGGAGATGGTCAATTCATCTGTCCTAGCAAGTCTAACCACAGGCATCCAACTGTTAGAAAACCTACAATTAAATTCTGTAAGGGCTATTGGGAATGTTAGTGCTGAAATTTGCATGCAGGAAAAATTAGATCAGCAAAATAAAGGGAATTTCATGATAAATGCAATGAAGAAGAAGGCATTTTTAAATAGAGAGTTATTATTGGGACCACTTCAAGATAGCAGACTTACTCATATCATGACAACAGTTGGCGAAGAGGCTCTTGAGAGTGAAACACTAATAACCAATCTTATAAAGGCTGGGACTTCTATTATTCGAATCAATTGTGCACATGGAAACCCGCAACTTTGGAGTGAGATAATCAGAAGGGTAAAACAAAGTTCTCAAATGCTGGAGTCACCATGTCGAATTCTTATGGATTTAGCCGGACCAAAACTTCGTACAGATAATCTAAAGCCTGGTCCATGTGTGGTAAAAATATCTCCAAAGAAAAATGCTTCTGGAAATGTGATTTTTCCTGCACAAGTTTGGCTTTCTCACAAAGGAGCCGGCCCGCCTCCTCCTCATCTTTCTCCTGATGCAGTTCTGTTCATAGATGACCAAGAATTTCTCACTGAGGTCAAAGTAGGTGATACCTTGAGGTTCTTTGATGCTAGAGGTAAGAAAAGGATGCTAAAGATCTCCAGAGTCTTTCACATTTTTTCAGGTACTGGTTTTATGGCTGAGTGTACTAGGACTGCTTATGTTAGTTCTGGAACCGAATTACTTATTAAGAGGAAGAAAGGTAGGTTCCTTGTTGGACAAGTGGTAGATGTCCCTGCTAGAGAGTCATTTATCAGACTAAGAGTTGGAGACTTGCTAATTATATTGCGGGATGGTAAGTCTGACCAGGATAACTCCTATGGGCACACAAGTCGTGCTTATAGAATAGCATGTTCATCAGGCTATCTGTTTGATGCAGTCAAACCTGGAGAGCGCATAGCTTTTGATGATGGAAAGATTTGGGGAGTCATCAAGGGAACTAGTAGTTCAGAGATTGTTGTCTCGATTACTCATGCTGGCCCAAGAGGGACTAAACTTGGATCACAGAAATCCATCAACATTCCAGACAGCAATATTCGGTATGAAGGTCTGACTTCAAAGGATCTGGTGGATCTCGAATTTGTTGCTTCCCATGCAGACATGGTGGGTGTTTCATTTGTTCGAGATACTCGTGATGTTATTGTACTTCGCCAAGAACTGGAGAAAAGGAAACTTCAGAACTTGGGGATTGTTTTGAAAATTGAAACAAAAAGTGGGTTTGAGAAATTGCCCCTGTTGCTCTTGGAGGCAATGAAGTCTTCAAATCCTTTAGGGGTTATGATTGCCAGAGGAGATCTTGCAGTAGAGTGTGGCTGGGAAAGATTGGCTGATATACAAGAGGAAATATTGTCTGTTTCTGGCACTGCTCACATACCGGTTATTTGGGCAACTCAGGTCCTGGAATCACTTGTCAAATCTGGTATTCCTACCAGAGCTGAGATTACTGATGTTGCAAATGGAAGGAGGGCAAGCTGCATTATGTTGAATAAAGGGAGACACATTGTACAAGCTGTTTCAACTCTAGACAGCATCCTCCGGGCTAACTCCAAGGAGATGAAAGCCGAACGGAAGCCTCTTGTTCTGTCCAGCCATCTCTTTTAG |
Protein: MTASIVTNHKAIPRHAIHTADVSELGLCTLGGKFAYLPCRLNVRQSAQMVQLLAKFRNTLARKTTAFAIPNENNEVERSGSHACSDDQVLMPLENYNSSGHLEGEAVDSLSKSEAGFFQRVEHLGNQARVLDKLRAVYLHVLASEQWNASSLKLSHKNYMESATNLIHYLALKSLDTEALKDDLALISLLNLEMVNSSVLASLTTGIQLLENLQLNSVRAIGNVSAEICMQEKLDQQNKGNFMINAMKKKAFLNRELLLGPLQDSRLTHIMTTVGEEALESETLITNLIKAGTSIIRINCAHGNPQLWSEIIRRVKQSSQMLESPCRILMDLAGPKLRTDNLKPGPCVVKISPKKNASGNVIFPAQVWLSHKGAGPPPPHLSPDAVLFIDDQEFLTEVKVGDTLRFFDARGKKRMLKISRVFHIFSGTGFMAECTRTAYVSSGTELLIKRKKGRFLVGQVVDVPARESFIRLRVGDLLIILRDGKSDQDNSYGHTSRAYRIACSSGYLFDAVKPGERIAFDDGKIWGVIKGTSSSEIVVSITHAGPRGTKLGSQKSINIPDSNIRYEGLTSKDLVDLEFVASHADMVGVSFVRDTRDVIVLRQELEKRKLQNLGIVLKIETKSGFEKLPLLLLEAMKSSNPLGVMIARGDLAVECGWERLADIQEEILSVSGTAHIPVIWATQVLESLVKSGIPTRAEITDVANGRRASCIMLNKGRHIVQAVSTLDSILRANSKEMKAERKPLVLSSHLF |